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Hadrons have finite interaction size with dense material, a basic feature common to known forms of 
hadronic calorimeters (HCAL). We argue that substructure variables cannot use HCAL information 
to access the microscopic nature of jets much narrower than the hadronic shower size, which we call 
superboosted massive jets. It implies that roughly 15% of their transverse energy profile remains 
inaccessible due to the presence of long-lived neutral hadrons. This part of the jet substructure 
is also subject to order-one fluctuations. We demonstrate that the effects of the fluctuations are 
not reduced when a global correction to jet variables is applied. The above leads to fundamental 
limitations in the ability to extract intrinsic information from jets in the superboosted regime. The 
neutral fraction of a jet is correlated with its flavor. This leads to an interesting and possibly useful 
difference between superboosted W/Z/h/t jets and their corresponding backgrounds. The QCD jets 
that form the background to the signal superboosted jets might also be qualitatively different in 
their substructure as their mass might lie at or below the Sudakov mass peak. Finally, we introduce 
a set of zero-cone longitudinal jet substructure variables and show that while they carry information 
that might be useful in certain situations, they are not in general sensitive to the jet substructure. 


There are several reasons for why the physics of highly 
boosted massive jets is an important field in theoreti¬ 
cal and experimental particle physics nowadays and will 
continue to be so in the foreseeable future. The first rea¬ 
son why these jets are interesting is very practical. In 
the next run of the LHC, processes in which the mas¬ 
sive Standard Model (SM) particles, namely the top, W, 
Z and Higgs, are produced at large boost will become 
fairly common. The second is that in order to explore 
the unknown energy frontier and look for new physics 
characterized by mass scales higher than ever been stud¬ 
ied before, the searches typically face very energetic jets 
that sometimes originate from the massive SM degrees of 
freedom, and discriminating them from light (QCD) jets 
is an important task for any future high energy collider. 
The third is that in a very large class of SM extensions 
— motivated by either naturalness or simply the idea of 
minimal flavor violation — the expected robust signals 
are connected to events with boosted massive SM parti¬ 
cles and almost nothing else [1-4]. 

Just to have a concrete discussion, suppose we would 
like to examine in detail whether the discovered Higgs 
particle completely solves the SM unitarity problem or 
maybe it is not a pointlike particle and thus deviations 
from the SM predictions are expected (see Refs. [5-7] 
for recent related discussions). This basic test of the 
SM Higgs mechanism involves looking at the process 
WlWl —t VV (y = WhlZ^jh, with the subscript 
L denoting longitudinal polarization) at large invariant 
masses, mvv ^ ttiv ■ To have a reasonable measurement 
of mw^WL ) lo^st one of the VF’s should decay hadron- 
ically, resulting in a narrow massive jet. Such a jet would 
have a typical opening angle of the order of twice its mass 


divided by its transverse momentum, 9j ^ 2mj/pT- One 
might naively think that no fundamental problem arises 
when the boost is increased and 9j is decreasing. In the 
future all that would be required is to improve the HCAL 
granularity in the transverse direction such that the sizes 
of the basic hadronic cells divided by their distance from 
the interaction point will be smaller than 6j. Our main 
point is that this conclusion is incorrect because the inter¬ 
actions between hadrons and any known HCAL material 
produce a shower with a typical transverse size, dhad- For 
a given material and detector architecture one can then 
define a minimal angular size dhad below which the trans¬ 
verse jet substructure information in the HCAL will be 
washed out regardless of how fine the HCAL is made. We 
call superboosted massive jets the massive jets for which 
the typical opening angle is smaller than the hadronic 
shower angular size, 9j < dhad- Such jets will suffer from 
the fact that the part of the perturbative information 
carried by the effectively stable neutral particles, such as 
the neutron and K^, and for high boosts also Ks, A and 
cannot be recovered, as such particles are traceless 
outside the HCAL. 

Superboosted massive jets and finite hadronic 
shower size .—In a typical detector, the HCAL is built 
up to contain all hadrons produced in an event and mea¬ 
sure their energy and direction. Despite the complicated 
nature of the interaction of hadrons with material (for 
reviews, see, e.g., Refs. ]8-10]), a phenomenological de¬ 
scription of the average longitudinal and lateral sizes of 
the induced showers as a function of the hadron energy is 
available. The hadrons relevant to our discussion are very 
energetic. For example, for W jets with pr near 3 (10) 
TeV, the three leading long-lived hadrons carry on aver- 
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age energies of 1200 (2700), 700 (1500), 490 (1100) GeV, 
and the three leading neutral ones carry 600 (1330), 210 
(470), 80 (190) GeV [11]. Similar numbers are obtained 
for QCD jets. For hadrons in this range of hundreds of 
GeV, the dependence on the energy and species is rather 
mild [8, 12, 13]. The 95% longitudinal containment of 
hadronic shower cascades, Lg^%, which is the average 
calorimeter depth within which 95% of the hadronic cas¬ 
cade energy will be deposited, is described in terms of 
the nuclear interaction length, A^, as [8] 

L 95 % « (6.2 -f 0.8 ln(£;/100 GeV)) Aa • (1) 

The 95% lateral containment for hadronic cascades, dg^%, 
can also be expressed in terms of Aa [8], 

<^ 95 % ~ ■ ( 2 ) 

Smaller interaction lengths are obtained for materials 

with larger atomic weights, with Aa ~ 10, 11, 15, 17, 

17, 40 cm for tungsten, uranium, copper, iron, lead, and 
aluminum respectively, while scintillator materials typi¬ 
cally have larger interactions lengths. Effective interac¬ 
tion lengths of HCALs (composed of scintillator and stop¬ 
ping material) thus cannot be shorter than ~ 10 cm, with 
typical values, e.g. in ATLAS and CMS, and the proto¬ 
type future calorimeter CALICE [13], being 20-30 cm. 

One can then define a minimal scale, 

C^had ~ *^95% ) (8) 

below which the perturbative jet information becomes 
increasingly unresolvable in the HCAL due to overlap 
between the hadronic showers (see, e.g.. Ref. [14]). Thus, 
for any HCAL at a radial distance thcal from the beam 
axis, one can define a reference angular size, 0hadi below 
which the jet substructure information is expected to get 
lost, 

„ dhad „ 1 ^HCAL 2 m 

dhad « - « 0.1 X —- X - . (4) 

fHCAL 20 cm thcal 

While it seems very challenging to improve upon Arcal, 
it is in principle possible to decrease dhad by increasing 
the radial distance, thcal- A typical opening angle of 
a boosted t or W jet is 6t^w = ‘^mt^w/PT- Thus, as¬ 
suming Ahcal = 20 cm, to resolve the substructure of 
a 3(10) TeV jet the HCAL needs to be at a distance 
of at least thcal ~ 2,4(6,12) meters from the beam 
pipe. Note that it means that superboosted jets might 
become relevant already at the LHC, since the active in¬ 
ner radius of the HCAL is 2.3 m for ATLAS and 1.8 m for 
CMS. Furthermore, hadronic showers sometimes start al¬ 
ready in the electromagnetic calorimeter (ECAL), which 
has an inner radius of 1.4 (1.3) m in ATLAS (CMS). 
The calorimeter shower size may or may not be the most 
important limitation, since an angular size of about 0.1 
describes also the granularity of the ATLAS and CMS 


HCALs. However, future colliders are expected to have 
much better HCAL granularities (see, e.g., Ref. [12]), so 
the HCAL shower size will become the leading obsta¬ 
cle. While scaling up the detectors would eliminate the 
problem, this would be very costly, not only due to the 
increased HCAL volume but also due to the increased 
volume of the magnetic field for the muon detector. This 
will likely make such a solution unrealistic. 

Limitations of jet substructure variables with¬ 
out neutrals. — The results obtained above lead to the 
conclusion that in the future the energy frontier will al¬ 
most unavoidably have to deal with jets in the super- 
boosted regime. In this regime, jet substructure analyses 
will have to rely solely on information obtained by the 
tracker and ECAL. Methods using only tracker and/or 
ECAL information have already been explored in the lit¬ 
erature [15-21]. Here we take a somewhat orthogonal 
path and attempt to characterize the unavoidable fluctu¬ 
ations that arise in (practically all) jet substructure vari¬ 
ables due to the spatially unresolvable energy depositions 
of the neutral hadrons. (A note on terminology: in re¬ 
alistic situations, each “PE neutral” object of CMS [21] 
contains energy depositions of multiple almost-collinear 
hadrons produced in the showering and hadronization of 
the same parton. This commonly includes the purely 
electromagnetic tt^’s. Our discussion assumes such elec¬ 
tromagnetic depositions to be perfectly resolvable, and 
focuses on the long-lived neutral hadrons.) 

In the following, we simulate events using 
Pythia 8.205 [11] with the default settings, inter¬ 
faced with East Jet [22]. In a more detailed study, 
one would also check how the results change when 
varying the Pythia settings or using a different Monte 
Carlo (e.g., Sherpa [23]), to estimate the systematic 
uncertainties. However, as our goal in this Letter is 
not to study any particular jet substructure variable 
in detail, but to only use several simple variables to 
exemplify our points, we will stick to the default set¬ 
tings. We have checked, nevertheless, that changing the 
color reconnection model from the MPI-based original 
Pythia 8 scheme (the default choice) to the new more 
QCD-based scheme or the new gluon-move model, does 
not have any significant effect on the results presented 
below. 

In Pig. 1, we show the fraction of energy carried by 
neutrons, ATl’s, as well as all other neutral hadrons that 
due to a large boost happen to decay farther than 2 m 
from the beam axis, for boosted W and QCD jets with 
Pt = 3 and 10 TeV. These results are based on a simu¬ 
lation of WW and QCD events in 100 TeV pp collisions. 
We use as our defaults anti-fc^ jets [24] with cone size 
R = imw/pT = 0.08 (0.024). Smaller cones would fre¬ 
quently fail to capture the W decay products [25], while 
larger cones would increase the QCD background at mw 
since the average mass of a QCD jet is (mj) ~ ocsPtR, 
with the peak of the mj distribution (the Sudakov peak) 
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FIG. 1. Energy fractions carried by long-lived neutral hadrons 
in boosted W jets (solid blue) and QCD jets (dashed red) for 
Pt = S TeV (left) and 10 TeV (right). 

being somewhat lower. Below, we shall discuss additional 
impacts of using larger cones. The mean, (/at), and stan¬ 
dard deviation, J/at, of the energy fraction taken by the 
neutrals in the 3 (10) TeV boosted W and QCD jets are 
(in %) 

(^J,QCD) ^ ^ = 15,13 (15,13). 

( 5 ) 

It implies that tracker-l-ECAL based jets capture roughly 
85% ± 15% of the actual jet energy. For subjets, ob¬ 
tained by reclustering the jet constituents using the anti- 
kr algorithm with cone size i?subjet = (3/4) mw/PT, the 
means are similar to that of the whole jet, while the fluc¬ 
tuations are larger — by factors of 1.3-1.4 for each of the 
two leading subjets. We note in passing that the neutral 
fraction depends on the flavor composition of the boosted 
jet partonic origin. This can potentially be used as a dis¬ 
criminator in certain situations. For hadronic W, t and 
h —>• 66 10 TeV jets, we find 


A global correction cannot cancel the fluctuations of the 
individual subjets, flf. 

Let us consider, for example, the jet mass, which is 
among the simplest possible jet substructure variables. 
We will show that applying a global correction to the 
jet does not reduce the fluctuations. The jet mass for 
boosted 2-body hadronic decays of lT/Z/6 (signal) is 
dominated by just the two-prong kinematics, making it 
simple to describe. For QCD jets, the mass distribution 
depends on the jet cone size. We shall consider two cases 
in the context of QCD jets as background for W jets, 
for a fixed jet pp- (i) the W mass falls in the tail re¬ 
gion of the QCD jet mass distribution, away from the 
Sudakov peak, such that the two-prong approximation 
roughly holds (see, for instance. Refs. [29-31]) and (ii) 
the W mass is near or below the Sudakov peak, where 
the QCD jet mass is partially driven by resummation of 
multiple emissions (see, e.g., Refs. [32, 33] and references 
therein). 

The two-prong kinematics of a narrow jet can be 
fully described by its energy, E 12 = Ei + E 2 , mass, 
mi 2 = EiE 29 i 2 , and the energy fraction in the softer 
parton/subjet, z = E 2 /E 12 < Without the HCAL, 
one measures 

(1 -/ w)(l -/^)™12 . ( 7 ) 

where the subscript denotes that the neutrals are omit¬ 
ted. We have neglected a possible shift in 612 since the 
angular resolution of the tracker is very good and the 
subjets are very collimated. The global jet correction ac¬ 
counts for the average neutral fraction by rescaling the 
mass according to m .12 corr = ^12 jy x Ej/Ejjfj, where 
Ep jf[ is the energy of all the particles in the jet that can 
be measured using the tracker and ECAL, namely 


/ pW—>’Cs, W—^ud 

\Jn 

a t—>bcs, t—^bud 
N 

a h^bb 
N 


) =21,14, 
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6/a/ = 16,14, 

t^bud ^ ^ 2 , 11 , 

= 13. (6) 


^12,corr 


E. (1 - %) E^ 


'bni 2 ,ifi , 


( 8 ) 


where the sums are over all the subjets. At linear order in 
/at, /w, and y = (X), Ei - Ei - E 2 ) / Y.i Ei, we obtain 


As is well known, one can apply a global rescaling 
to correct for the missing neutral component based on 
the total jet energy, Ej, including the energy deposited 
in the HCAL. For recent discussions in the context of 
boosted jets, see [15, 16]. (For a formal theoretical 
treatment for QCD jets, see [26, 27].) Jet energy res¬ 
olution, which for instance in CMS is given roughly by 
a{Ej)/Ej « 1.0/v/Fj/GeV©0.05 [28], is only a minor 
limitation, since already for Ej > 50 GeV the associated 
fluctuations are below 15%. Now we would like to ar¬ 
gue that such a global correction does not compensate 
for fluctuations in jet substructure variables. The reason 
is very simple: jet substructure, by definition, character¬ 
izes some kinematic properties of the jet’s perturbative 
constituents, the subjets. However, each subjet is sub¬ 
ject to an independent fluctuation in the neutral fraction. 


mi 2 \Z J 

where ^ fhEi/J^ Ei, with the sums in start¬ 

ing from 1 = 3. For the mean values of the correc¬ 
tion is perfect if we neglect the last term and the weak 
dependence of (/^) on Ei. Statistical fluctuations lead 
to fluctuations in mi 2 ,corr ~ W 12 , 


[ S f ~ 

V V "*12 


+ {yrisf%+r+{f^+r{syr. 

( 10 ) 


Note that the size of the fluctuations is z dependent. It 
is interesting to see that it might be beneficial to cut 
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FIG. 2. Jet mass based on the two leading subjets: mi 2 
(truth value, required to be 75 ± 5 GeV, thin black), 
(without the neutrals, dotted green) and mi 2 ,corr (corrected, 
thick blue) for boosted W jets with px ~ 10 TeV, for cone 
sizes R = ‘imw Ivt (left) and R = l^mw /pt (right). In both 
cases 7?subjet = {?> j4) mw! Pt ■ 




FIG. 3. Standard deviation of the relative offset in the jet 
mass after the correction (for jets with truth mass in the 
range 75 ±5 GeV) as a function of z, for boosted W jets (thin 
blue) and QCD jets (thick red) with jet px = 3 TeV (solid) 
and 10 TeV (dotted). On the left we use jets with cone size 
R — Smwipx, while on the right we use large cones (the Su- 
dakov peak case). In both cases i?subjet = (3/4) mw jpx- The 
mean offset (not shown) is much smaller than the standard 
deviation. 


away the low z events as this would reduce the average 
fluctuation in the mass. 

In Fig. 2 we show the distributions of the truth jet 
mass, mi 2 , the mass without the neutrals, as well 

as the globally corrected one, TOi 2 ,corr, for W jets with 
Pt = 10 TeV. We focus on events where the W mass is 
indeed captured by the two leading subjets at the truth 
level by requiring mi 2 = 75±5 GeV. We see that the large 
fluctuations of TOi 2 ,corr relative to 77112 remain despite the 
correction. 

The fluctuations are most significant for low z, as 
shown in Fig. 3, consistent with the expectation from 
Eq. (10). Analogous results for jets from the QCD back¬ 
ground are also included. For the small cone (R = 
‘imy/I pt), ttie QCD jets are far from the Sudakov peak, 
and are therefore dominated by the two-prong approx¬ 
imation and exhibit the same z dependence. Differ¬ 
ently, when the QCD background jets are in the Sudakov 


peak region, which we obtain for large jet radii {R = 
9mw/pT = 0.24 for 3 TeV jets and R = Ibmw/PT = 
0.12 for 10 TeV jets), they are affected by a large num¬ 
ber of emissions, so we expect the correlation with the 
two-prong variable, z, to be rather weak. Figure 3 (right) 
confirms this expectation. 

The only other jet-substructure variable that is inde¬ 
pendent of the mass, for two-prong kinematics, is z itself. 
To leading order, after fixing the mass, W jets have a flat 
z distribution while for QCD jets it is proportional to 1/z 
for small z’s [29]. It is therefore possible to apply a lower 
cut on z to enhance the signal over the corresponding 
QCD background [34], or alternatively apply an upper 
cut on z to obtain a background-enriched sample to study 
massive QCD jets or have a control region. However, the 
impact of the lost neutrals on the signal and background 
efficiencies is quite minor as the z distributions of both 
the signal and background are pretty broad to start with. 
This is also being reflected by the fact that cutting on z 
is not particularly useful for rejecting the background. 

Zero-cone-size, longitudinal jet information .— 
Future HCALs are envisioned to have an improved granu¬ 
larity not only in the transverse but also in the longitudi¬ 
nal direction (see, e.g., Ref. [12]), allowing to measure the 
longitudinal energy deposition profile. In principle, the 
profile is sensitive to the energy depositions of individual 
hadrons. Separation between them is slightly aided by 
the fact that the shower starts at a random depth for each 
hadron. The relevant so-called pion interaction length is 
comparable to Aa [12]. Remarkably, the longitudinal in¬ 
formation is available even if the hadrons are completely 
collinear, when the conventional jet substructure vari¬ 
ables, all of which depend on transverse separation, are 
powerless. 

In practice, extracting individual contributions from a 
measured profile may be challenging, as there will still be 
a significant degree of shower overlap, the shower shapes 
vary significantly event-by-event [10, 35], and the granu¬ 
larity will still be a limiting factor. We will not analyze 
this in detail, but discuss how information obtained in 
this way can potentially be useful. 

If each hard parton produced one hard hadron and a 
few softer ones, the longitudinal prohle of a boosted W 
jet, for example, would typically contain two relatively 
large humps, while a QCD jet would lead to a single and 
more energetic one. That would likely be easy to see. In 
practice, each high-py parton produces several compara¬ 
bly energetic hadrons, so the picture is more complicated, 
but one might still hope that some information about the 
underlying partonic structure remains. One could imag¬ 
ine variables such as the px fraction carried by the leading 
hadron, or the number of hadrons one needs to sum to 
account for a certain fraction of the jet px- If one of the 
hadrons is a 7r°(—)• 77 ) and thus deposits all of its energy 
in the ECAL, it can be accounted for in a trivial way and 
only make the interpretation of the HCAL profile easier. 
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FIG. 4. Jet pt fractions carried by the leading hadron (left) 
and the second leading hadron (right) for boosted W jets 
(solid blue) vs. QCD jets (dashed red) for jet pt = 10 TeV. 


One might hope that the availability of the longitudi¬ 
nal profile makes the loss of transverse information less 
severe of an issue. However, we find that the longitudinal 
information, even at the truth level (be., before simulat¬ 
ing the HCAL showers) is quite limited, for the scenarios 
we analyzed in this paper. For example, Fig. 4 shows that 
the distributions of the px fractions of the two leading 
hadrons are quite similar for boosted W jets and QCD 
jets. We believe that the similarity is to a large extent 
accidental and there may exist other scenarios in which 
the longitudinal variables would be effective. One may 
also consider using such variables in other contexts, e.g., 
for distinguishing jets initiated by a quark in a certain 
process from those initiated by a gluon in another. It is 
clear though that they do not provide a general solution 
for the superboost regime. 

Overview. — When a hadron propagates in a mate¬ 
rial, it produces a shower of a finite size. This sets a lower 
bound on the angular scale that can be probed using 
HCALs in typical experiments. We defined superboosted 
jets as jets originating from energetic massive particles, 
with an opening angle smaller than this minimal angu¬ 
lar scale. HCALs are thus insensitive to substructure 
of superboosted jets, so the information carried by the 
effectively-stable neutral particles is unrecoverable. For 
simplicity, we have focused on two-prong variables, the 
jet mass and splitting fraction. We have demonstrated 
that fluctuations in the energy carried by the neutral par¬ 
ticles lead to a smearing of the resulting jet-substructure 
distribution. This qualitative feature is expected to be 
shared by more complicated substructure variables, as 
long as the signal distribution is narrow, where the fluc¬ 
tuations in the neutral energy fractions of the third and 
further subjets will play an important role. 

The superboosted regime is characterized by jets with 
large transverse momenta. In that regime, depending on 
the size of the jets, the average mass of QCD jets can be 
either much smaller or larger than the mass of the signal 
jets produced from hadronic decays of, say, the heavy SM 
particles, IF/Z//i/t. The latter case is particularly inter¬ 


esting as the QCD jets and the ones originating from sig¬ 
nal events would behave in a qualitatively different way. 
While the QCD jets will have a rich internal structure, 
WlZjh (and possibly t) superboosted jets will be much 
sparser. This is somewhat similar to the difference be¬ 
tween QCD and hadronic t jets measured presently at 
the LHC. The fundamental difference in the nature of 
the signal and background jets may turn out useful for 
suppressing the QCD background when searching for su¬ 
perboosted massive jets, an idea that merits a dedicated 
study. 

Finally, we have shown that the fraction of neutrals has 
a sizable flavor dependence, especially evident when com¬ 
paring the decay of a boosted W to second-generation 
quarks with its decay to first-generation quarks or with a 
QCD jet. This could be used as another handle to distin¬ 
guish signal superboosted jets from their corresponding 
QCD backgrounds. Distributions of the neutral fractions 
for various jet flavors can in fact be measured in tt events, 
with and without charm tagging of jets from W decays. 
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